AITopics | Support Vector Machines

Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales Tao Liu 1, P. R. Kumar 1

Neural Information Processing SystemsMay-29-2025, 10:28:12 GMT

Motivated by the problem of learning with small sample sizes, this paper shows how to incorporate into support-vector machines (SVMs) those properties that have made convolutional neural networks (CNNs) successful. Particularly important is the ability to incorporate domain knowledge of invariances, e.g., translational invariance of images. Kernels based on the maximum similarity over a group of transformations are not generally positive definite. Perhaps it is for this reason that they have not been studied theoretically. We address this lacuna and show that positive definiteness indeed holds with high probability for kernels based on the maximum similarity in the small training sample set regime of interest, and that they do yield the best results in that regime. We also show how additional properties such as their ability to incorporate local features at multiple spatial scales, e.g., as done in CNNs through max pooling, and to provide the benefits of composition through the architecture of multiple layers, can also be embedded into SVMs. We verify through experiments on widely available image sets that the resulting SVMs do provide superior accuracy in comparison to well-established deep neural network benchmarks for small sample sizes.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Texas > Brazos County > College Station (0.14)

Genre: Research Report > Experimental Study (0.54)

Industry: Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)

Add feedback

Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales Tao Liu 1, P. R. Kumar 1

Neural Information Processing SystemsMay-29-2025, 10:28:08 GMT

Motivated by the problem of learning with small sample sizes, this paper shows how to incorporate into support-vector machines (SVMs) those properties that have made convolutional neural networks (CNNs) successful. Particularly important is the ability to incorporate domain knowledge of invariances, e.g., translational invariance of images. Kernels based on the maximum similarity over a group of transformations are not generally positive definite. Perhaps it is for this reason that they have not been studied theoretically. We address this lacuna and show that positive definiteness indeed holds with high probability for kernels based on the maximum similarity in the small training sample set regime of interest, and that they do yield the best results in that regime. We also show how additional properties such as their ability to incorporate local features at multiple spatial scales, e.g., as done in CNNs through max pooling, and to provide the benefits of composition through the architecture of multiple layers, can also be embedded into SVMs. We verify through experiments on widely available image sets that the resulting SVMs do provide superior accuracy in comparison to well-established deep neural network benchmarks for small sample sizes.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Texas > Brazos County > College Station (0.14)

Genre: Research Report > Experimental Study (0.54)

Industry: Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.70)

Add feedback

Supplement: Novel Upper Bounds for the Constrained Most Probable Explanation Task

Neural Information Processing SystemsMay-28-2025, 21:18:43 GMT

It is well known that any MPE task can be encoded as an integer linear programming (ILP) problem (cf. A popular or widely used formulation is to associate a Boolean variable with each entry in each function of the log-linear model. When the Boolean variable is assigned the value 1, the entry is selected, otherwise it is not. For instance, a type of consistency constraint encodes the restriction that only entry from each function must be selected. A second type of consistency constraint ensures that if two functions share a variable then only entries which assign the shared variable to the same value are selected.

artificial intelligence, machine learning, upper bound, (13 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.32)

Add feedback

On the Error Resistance of Hinge Loss Minimization

Neural Information Processing SystemsMay-28-2025, 20:39:17 GMT

Commonly used classification algorithms in machine learning, such as support vector machines, minimize a convex surrogate loss on training examples. In practice, these algorithms are surprisingly robust to errors in the training data. In this work, we identify a set of conditions on the data under which such surrogate loss minimization algorithms provably learn the correct classifier. This allows us to establish, in a unified framework, the robustness of these algorithms under various models on data as well as error. In particular, we show that if the data is linearly classifiable with a slightly non-trivial margin (i.e. a margin at least C / d for d-dimensional unit vectors), and the class-conditional distributions are near isotropic and logconcave, then surrogate loss minimization has negligible error on the uncorrupted data even when a constant fraction of examples are adversarially mislabeled.

artificial intelligence, assumption, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > New York > New York County > New York City (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)

Add feedback

2974788b53f73e7950e8aa49f3a306db-Supplemental.pdf

Neural Information Processing SystemsMay-28-2025, 19:57:00 GMT

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Country:

Asia > China (0.28)
North America (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

2974788b53f73e7950e8aa49f3a306db-Paper.pdf

Neural Information Processing SystemsMay-28-2025, 19:56:53 GMT

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > China (0.28)
North America (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

29586cb449c90e249f1f09a0a4ee245a-Paper.pdf

Neural Information Processing SystemsMay-28-2025, 19:52:22 GMT

artificial intelligence, dimension reduction, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report (0.68)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning in High Dimensional Spaces (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)

Add feedback

2e37e56599e3f49cc899f40ae4f5d1fa-Paper-Conference.pdf

Neural Information Processing SystemsMay-28-2025, 18:53:16 GMT

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

Add feedback

Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests

Neural Information Processing SystemsMay-28-2025, 18:12:17 GMT

Multiple Instance Learning (MIL) is a sub-domain of classification problems with positive and negative labels and a "bag" of inputs, where the label is positive if and only if a positive element is contained within the bag, and otherwise is negative. Training in this context requires associating the bag-wide label to instance-level information, and implicitly contains a causal assumption and asymmetry to the task (i.e., you can't swap the labels without changing the semantics). MIL problems occur in healthcare (one malignant cell indicates cancer), cyber security (one malicious executable makes an infected computer), and many other tasks. In this work, we examine five of the most prominent deep-MIL models and find that none of them respects the standard MIL assumption. They are able to learn anticorrelated instances, i.e., defaulting to "positive" labels until seeing a negative counter-example, which should not be possible for a correct MIL model.

artificial intelligence, assumption, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America > United States > Maryland (0.28)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

Add feedback

Horospherical Decision Boundaries for Large Margin Classification in Hyperbolic Space

Neural Information Processing SystemsMay-28-2025, 16:07:44 GMT

Hyperbolic spaces have been quite popular in the recent past for representing hierarchically organized data. Further, several classification algorithms for data in these spaces have been proposed in the literature. These algorithms mainly use either hyperplanes or geodesics for decision boundaries in a large margin classifiers setting leading to a non-convex optimization problem. In this paper, we propose a novel large margin classifier based on horospherical decision boundaries that leads to a geodesically convex optimization problem that can be optimized using any Riemannian gradient descent technique guaranteeing a globally optimal solution.

artificial intelligence, hyperbolic space, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)

Add feedback

Filters

Collaborating Authors

Support Vector Machines

Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales Tao Liu 1, P. R. Kumar 1

Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales Tao Liu 1, P. R. Kumar 1

Supplement: Novel Upper Bounds for the Constrained Most Probable Explanation Task

On the Error Resistance of Hinge Loss Minimization

2974788b53f73e7950e8aa49f3a306db-Supplemental.pdf

2974788b53f73e7950e8aa49f3a306db-Paper.pdf

29586cb449c90e249f1f09a0a4ee245a-Paper.pdf

2e37e56599e3f49cc899f40ae4f5d1fa-Paper-Conference.pdf

Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests

Horospherical Decision Boundaries for Large Margin Classification in Hyperbolic Space